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Abstract: In the principles-and-parameters model of language, the principle 
known as 'free indexation' plays an important part in the process of determining 
the referential properties of elements such as anaphors and pronominals. This 
paper addresses two issues. (1) We investigate the combinatorics of free index- 
ation. By relating the problem to the n-set partitioning problem, we show that 
free indexation must produce an exponential number of referentially distinct 
phrase structures given a structure with n (independent) noun phrases. (2) 
We introduce an algorithm for free indexation that is defined compositionally 
on phrase structures. We show how the compositional nature of the algorithm 
makes it possible to incrementally interleave the computation of free indexa- 
tion with phrase structure construction. Additionally, we prove the algorithm 
to be an 'optimal' procedure for free indexation. More precisely, by relating 
the compositional structure of the formulation to the combinatorial analysis, 
we show that the algorithm enumerates precisely all possible indexings, without 
duplicates. 
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1 Free Indexation 

Consider the ambiguous sentence: 

(1) John believes Bill will identify him 

In (1), the pronominal "him" can be interpreted as being coreferential with 
"John", or with some other person not named in (1), but not with "Bill". We 
can represent these various cases by assigning indices to all noun phrases in a 
sentence together with the interpretation that two noun phrases are coreferential 
if and only if they are coindexed, that is, if they have the same index. Hence 
the following indexings represent the three coreference options for pronominal 
"him": 1 

(2) a. Johnx believes Bill 2 will identify him x 

b. Johni believes BHI2 will identify hiiri3 

c. *Johni believes BHI2 will identify him2 

In the principles-and-parameters framework (Chomsky [3]), once indices 
have been assigned, general principles that state constraints on the locality 
of reference of pronominals and names (e.g. "John" and "Bill") will conspire 
to rule out the impossible interpretation (2c) while, at the same time, allow 
the other two (valid) interpretations. The process of assigning indices to noun 
phrases is known as "free indexation," which has the following general form: 

(4) Assign indices freely to all noun phrases. 2 

In such theories, free indexation accounts for the fact that we have coreferential 
ambiguities in language. Other principles interact so as to limit the number 
of indexings generated by free indexation to those that are semantically well- 
formed. 



'Note that the indexing mechanism used above is too simplistic a framework to handle 
binding examples involving inclusion of reference such as: 

(3) a. Wej think that Ii will win 

b. Wei think that I2 will win 

c. *Wei like myselfj 

d. John told Bill that they should leave 

Richer schemes that address some of these problems, for example, by representing indices as 
sets of numbers, have been proposed. See Lasnik [9] for a discussion on the limitations of, and 
alternatives to, simple indexation. Also, Higginbotham [7] has argued against coindexation 
(a symmetric relation), and in favour of directed links between elements (Unking theory). In 
general, there will be twice as many possible 'Unkings' as indexings for a given structure. 
However, note that the asymptotic results of Section 2 obtained for free indexation wiU also 
hold for linking theory. 

2 The exact form of (4) varies according to different versions of the theory. For example, 
in Chomsky [4] (pg.59), free indexation is restricted to apply to A-positions at the level of 
S-structure, and to A-positions at the level of logical form. 



In theory, since the indices are drawn from the set of natural numbers, there 
exists an infinite number of possible indexings for any sentence. However, we 
are only interested in those indexings that are distinct with respect to semantic 
interpretation. Since the interpretation of indices is concerned only with the 
equality (and inequality) of indices, there are only a finite number of semanti- 
cally different indexings. 3 For example, "Johni likes Mary 2 " and "John23 likes 
Mary 4 " are considered to be equivalent indexings. Note that the definition in (4) 
implies that "John believes Bill will identify him" has two other indexings (in 
addition to those in (2)): 

(5) a. *Johni believes Billi will identify himi 
b. *Johni believes Billi will identify hini2 



In some versions of the theory, indices are only freely assigned to those noun 
phrases that have not been coindexed through a rule of movement (Move-a). 
(see Chomsky [3] (pg.331)). For example, in "Whoi did John see [jvp<]i?", the 
rule of movement effectively stipulates that "Who" and its trace noun phrase 
must be coreferential. In particular, this implies that free indexation must not 
assign different indices to "who" and its trace element. For the purposes of free 
indexation, we can essentially 'collapse' these two noun phrases, and treat them 
as if they were only one. Hence, this structure contains only two independent 
noun phrases. 4 

2 The Combinatorics of Free Indexation 

In this section, we show that free indexation generates an exponential number 
of indexings in the number of independent noun phrases in a phrase structure. 
We achieve this result by observing that the problem of free indexation can be 
expressed in terms of a well-known combinatorial partitioning problem. 

Consider the general problem of partitioning a set of n elements into m non- 
empty (disjoint) subsets. For example, a set of four elements {w,x,y,z} can be 
partitioned into two subsets in the following seven ways: 



{w,x,y}{z} {w,x}{y,z} 

{w,x,z){y} {w,y){x,z} 

{w,y,z}{x} {w,z}{x,y} 
{x,y,z}{w} 



3 In other words, there are only a finite number of equivalence classes on the relation 'same 
coreference relations hold. ' This can easily be shown by induction on the number of indexed 
elements. 

^Technically, "who" and its trace are said to form a chain. Hence, the structure in question 
contains two distinct chains. 



The number of partitions obtained thus is usually represented using the nota- 
tion {^} (Knuth [8]). In general, the number of ways of partitioning n elements 
into m sets is given by the following formula. (See Purdom & Brown [10] for a 
discussion of (6).) 



(6) 



for n.m > 



The number of ways of partitioning n elements into zero sets, {£}, is defined 
to be zero for n > and one when n = 0. Similarly, {^}, the number of ways 
of partitioning zero elements into m sets is zero for m > and one when m = 0. 

We observe that the problem of free indexation may be expressed as the 
problem of assigning 1,2, ...,n distinct indices to n noun phrases where n is 
the number of noun phrases in a sentence. Now, the general problem of assigning 
m distinct indices to n noun phrases is isomorphic to the problem of partitioning 
n elements into m non-empty disjoint subsets. The correspondence here is that 
each partitioned subset represents a set of noun phrases with the same index. 
Hence, the number of indexings for a sentence with n noun phrases is: 



(7) 



m=l ^ ' 



(The quantity in (7) is commonly known as Bell's Exponential Number 
B n \ see Berge [2].) The recurrence relation in (6) has the following solution 
(Abramowitz [1]): 



(8) 



fc=0 

Using (8), we can obtain a finite summation form for the number of index- 
ings: 

(9) 

B » = E £ ( m _*)!*!*" 

It can also be shown (Graham [6]) that B n is asymptotically equal to (10): 

(10) 

m"e m »- n -3 



where the quantity m„ is given by: 



(11) 



m n In m n = n — — 



That is, (10) is both an upper and lower bound on the number of indexings. 
More concretely, to provide some idea of how fast the number of possible in- 
dexings increases with the number of noun phrases in a phrase structure, the 
following table exhibits the values of (9) for the first dozen values of n: 
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3 A Compositional Algorithm 

In this section, we will define a compositional algorithm for free indexation that 
provably enumerates all and only all the possible indexings predicted by the 
analysis of the previous section. 

The PO-PARSER is a parser based on a principles-and-parameters framework 
with a uniquely flexible architecture ([5]). In this parser, linguistic principles 
such as free indexation may be applied either incrementally as bottom-up phrase 
structure construction proceeds, or as a separate operation after the complete 
phrase structure for a sentence is recovered. The PO-PARSER was designed 
primarily as a tool for exploring how to organize linguistic principles for efficient 
processing. This freedom in principle application allows one to experiment with 
a wide variety of parser configurations. 

Perhaps the most obvious algorithm for free indexation is, first, to simply 
collect all noun phrases occurring in a sentence into a list. Then, it is easy to 
obtain all the possible indexing combinations by taking each element in the list 
in turn, and optionally coindexing it with each element following it in the list. 
This simple scheme produces each possible indexing without any duplicates and 
works well in the case where free indexing applies after structure building has 
been completed. 

The problem with the above scheme is that it is not flexible enough to deal 
with the case when free indexing is to be interleaved with phrase structure 
construction. Conceivably, one could repeatedly apply the algorithm to avoid 
missing possible indexings. However, this is very inefficient, that is, it involves 
much duplication of effort. Moreover, it may be necessary to introduce extra 



machinery to keep track of each assignment of indices in order to avoid the 
problem of producing duplicate indexings. Another alternative is to simply delay 
the operation until all noun phrases in the sentence have been parsed. (This is 
basically the same arrangement as in the non-interleaved case.) Unfortunately, 
this effectively blocks the interleaved application of other principles that are 
logically dependent on free indexation to assign indices. For example, this means 
that principles that deal with locality restrictions on the binding of anaphors 
and pronominals cannot be interleaved with structure building (despite the fact 
that these particular parser operations can be effectively interleaved). 

An algorithm for free indexation that is defined compositionally on phrase 
structures can be effectively interleaved. That is, free indexing should be de- 
fined so that the indexings for a phrase is some function of the indexings of 
its sub-constituents. Then, coindexings can be computed incrementally for all 
individual phrases as they are built. Of course, a compositional algorithm can 
also be used in the non-interleaved case. 

Basically, the algorithm works by maintaining a set of indices at each sub- 
phrase of a parse tree. 5 Each index set for a phrase represents the range of 
indices present in that phrase. For example, "Who,- did John ; see £,-?" has the 
following phrase structure and index sets: 

(12) [ C p [np Who,] [ c did [ IP [ NP John,] [ VP see [ N p <«■]]]]] 

{i,J} {i} {«',>} {iJ} {>} {•} {•} 

There are two separate tasks to be performed whenever two (or more) phrases 
combine to form a larger phrase. 6 First, we must account for the possibility that 
elements in one phrase could be coindexed (cross-indexed) with elements from 
the other phrase. This is accomplished by allowing indices from one set to 
be (optionally) merged with distinct indices from the other set. For example, 
the phrases "[jvpJohn,-]" and "[vp likes him,]" have index sets {i} and {j}, 
respectively. Free indexation must allow for the possibilities that "John" and 
"him" could be coindexed or maintain distinct indices. Cross-indexing accounts 
for this by optionally merging indices i and j. Hence, we obtain: 

(13) a. Johnj likes him,, i merged with j 

b. John,- likes himj, i not merged with j 



5 For expository reasons, we consider only pure indices. The actual algorithm keeps track of 
additional information, such as agreement features like person, number and gender, associated 
with each index. For example, irrespective of configuration, "Mary" and "him" can never have 
the same index. 

6 Some readers may realize that the algorithm must have an additional step in cases where 
the larger phrase itself may be indexed, for instance, as in [np [np John's ] mother]. In such 
cases, the third step is simply to merge the singleton set consisting of the index of the larger 
phrase with the result of cross-indexing in the first step. (For the above example, the extra 



Secondly, we must find the index set of the aggregate phrase. This is just 
the set union of the index sets of its sub-phrases after cross-indexation. In the 
example, "John likes him", (13a) and (13b) have index sets {i} and {i,j}- 

More precisely, let Ip be the set of all indices associated with the Binding 
Theory-relevant elements in phrase P. Assume, without loss of generality, that 
phrase structures are binary branching. Consider a phrase P = [p X Y] with 
immediate constituents X and Y . Then: 

1. Cross Indexing: Let Ix represent those elements of Ix which are not also 
members of Iy , that is, (Ix — Iy)- Similarly, let Iy be (Iy — Ix)- 7 

(a) If both Ix and Iy are empty sets, then done. 

(b) Let x and y be members of Ix and Iy , respectively. 

(c) Either merge indices x and y or do nothing. 

(d) Repeat from step (la) substituting Ix — {x} and Iy — {y} for Ix and 
Iy, respectively, if x and y have been merged. 

2. Index Set Propagation: Ip = Jy U Iy- 

The nondeterminism in step (lc) of cross-indexing will generate all and only 
all (i.e. without duplicates) the possible indexings. We will show this in two 
parts. First, we will argue that the above algorithm cannot generate duplicate 
indexings. That is, the algorithm only generates distinct indexings with respect 
to the interpretation of indices. As shown in the previous section, the com- 
binatorics of free-indexing indicates that there are only B n possible indexings. 
Next, we will demonstrate that the algorithm generates exactly that number 
of indexings. If the algorithm satisfies both of these conditions, then we have 
proved that it generates all the possible indexings exactly once. 

1. Consider the definition of cross-indexing. Ix represents those indices in X 
that do not appear in Y. (Similarly for Iy.) Also, whenever two indices 
are merged in step (lb), they are 'removed' from Ix and Iy before the 
next iteration. Thus, in each iteration, x and y from step (lb) are 'new' 
indices that have not been merged with each other in a previous iteration. 
By induction on tree structures, it is easy to see that two distinct indices 
cannot be merged with each other more than once. Hence, the algorithm 
cannot generate duplicate indexings. 

2. We now demonstrate why the algorithm generates exactly the correct num- 
ber of indexings by means of a simple example. Without loss of generality, 
consider the following right-branching phrase scheme: 



step is to just merge {»'} with {j}.) For expository reasons, we will ignore such cases. Note 
that no loss of generality is implied since a structure of the form [jyrPj [nPj • • ■ a •••]••• P • • ■] 
can be can always be handled as [p 1 [npJIpj [nP; - • • a •••]■■■ P ■ ■ •]]- 

7 Note that Ix and Iy are defined purely for notational convenience. That is, the algorithm 
directly operates on the elements of Ix and Iy 



NP, 




NPj NPi 



Now consider the following decision tree for computing the possible index- 
ings of the right-branching tree in a bottom-up fashion: 



NPs 
NPi 

NPj 
NP k 



Decision Tree 




i,j * k 



{»}{*»*} {hj} {hJ} {*>.?> *} 



Each node in the tree represents the index set of the combined phrase 
depending on whether the noun phrase at the same level is cross-indexed 
or not. For example, {«'} and {i, j] on the level corresponding to NPj are 
the two possible index sets for the phrase Py . The path from the root to 
an index set contains arcs indicating what choices (either to coindex or to 
leave free) must have been made in order to build that index set. Next, 
let us just consider the cardinality of the index sets in the decision tree, 
and expand the tree one more level (for NPi): 




K 



12 2 2 3 
122232232233334 



Informally speaking, observe that each decision tree node of cardinality i 
'generates' i child nodes of cardinality i plus one child node of cardinality 
i ! + 1. Thus, at any given level, if the number of nodes of cardinality m 
is c m , and the number of nodes of cardinality m — 1 is c m _i, then at the 
next level down, there will be mc m + c m _i nodes of cardinality m. Let 
c(n, m) denote the number of nodes at level n with cardinality m. Let the 
top level of the decision tree be level 1. Then: 

(14) 

c(n + 1, m + 1) = c(n, m) + (m + l)c(n, m + 1) 

Observe that this recurrence relation has the same form as equation (6). 
Hence the algorithm generates exactly the same number of indexings as 
demanded by combinatorial analysis. 

4 Conclusions 

This paper has shown that free indexation produces an exponential number of 
indexings per phrase structure. This implies that all algorithms that compute 
free indexation, that is, assign indices, must also take at least exponential time. 
In this section, we will discuss whether it is possible for a principle-based parser 
to avoid the combinatorial 'blow-up' predicted by analysis. 

First, let us consider the question whether the 'full power' of the free index- 
ing mechanism is necessary for natural languages. Alternatively, would it be 
possible to 'shortcut' the enumeration procedure, that is, to get away with pro- 
ducing fewer than B n indexings? After all, it is not obvious that a sentence with 
a valid interpretation can be constructed for every possible indexing. However, 
it turns out (at least for small values of n; see examples (15) and (16) below) 



7 To make the boundary cases match, just define c(0, 0) to be 1, and let c(0, m) = and 
c(n,0) = for to > and n > 0, respectively. 



that language makes use of every combination predicted by analysis. This im- 
plies, that all parsers must be capable of producing every indexing, or else miss 
valid interpretations for some sentences. 

There are B 3 = 5 possible indexings for three noun phrases: 8 

(15) a. Johni wanted PROi to forgive himself] (HI) 

b. Johni wanted PROi to forgive hiiri2 (H2) 

c. Johni wanted Maiy 2 to forgive himi (121) 

d. Johni wanted Mary 2 to forgive herself2 (122) 

e. Johni wanted Mary 2 to forgive hini3 (123) 

Similarly, there are fifteen possible indexings for four noun phrases: 

(16) a. Johni persuaded himselfi that hei should give himselfi up (HH) 

b. Johni persuaded Mary 2 PRO2 to forgive herself^ (1222) 

c. Johni persuaded himselfi PROi to forgive her2 (1H2) 

d. Johni persuaded Mary 2 PRO2 to forgive himi (1221) 

e. Johni persuaded Mary 2 PRO2 to forgive hini3 (1223) 

f. Johni wanted BUI2 to ask Mary 3 PRO3 to leave (1233) 

g. Johni wanted PROi to tell Mary 2 about herself2 (1122) 
h. Johni wanted Mary 2 to tell himi about himselfi (121 1) 
i. Johni wanted PROi to tell Mary 2 about himselfi (H21) 
j. Johni wanted B1II2 to tell Mary 3 about himself2 (1232) 
k. Johm wanted PROi to tell Mary 2 about Tom 3 (1123) 
1. Johni wanted Mary 2 to tell himi about Tom3 (1213) 
m. Johni wanted Mary 2 to tell Tom3 about himi (1231) 
n. Johni wanted Mary 2 to tell Tom3 about BilU (1234) 



Although it may be the case that a parser must be capable of producing 
every possible indexing, it does not necessarily follow that a parser must enu- 
merate every indexing when parsing a particular sentence. In fact, for many 
cases, it is possible to avoid exhaustively exploring the search space of possibil- 
ities predicted by combinatorial analysis. To do this, basically we must know, 
a priori, what classes of indexings are impossible for a given sentence. By fac- 
toring in knowledge about restrictions on the locality of reference of the items 
to be indexed (i.e. binding principles), it is possible to explore the space of in- 
dexings in a controlled fashion. For example, although free indexation implies 
that there are five indexings for "John thought [5 Tom forgave himself ] " , we 
can make use of the fact that "himself' must be coindexed with an element 
within the subordinate clause to avoid generating indexings in which "Tom" 
and "himself are not coindexed. 9 Note that the early elimination of ill-formed 
indexings depends crucially on a parser's ability to interleave binding principles 
with structure building. But, as discussed in Section 3, the interleaving of bind- 
ing principles logically depends on the ability to interleave free indexation with 

* PRO is an empty (non-overt) noun phrase element. 



structure building. Hence the importance of an formulation of free indexation, 
such as the one introduced in Section 3, which can be effectively interleaved. 
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